Lexical bundles in computational linguistics academic literature
نویسنده
چکیده
Corpus linguistics has been in the spotlight for the last decade, with the usage of modern computers and technologies deeper understanding of languages can be obtained. Corpus linguistics helps language teaching for acquiring a better view for language. Language teachers will know what sequence of words and patters tend to co-occur. Unlike previous enormous lists of words which students were forced to memorize them, and were seldom used and typically were forgotten in a short period of time lexical bundles can help language teachers to teach more effectively, and learners can be more fluent in the second language. The first studies in lexical bundles include Firth 1964 (Firth, J. R. (1964).) Biber (2004), (Cortes, V. (2004). Native speakers use a formulaic pattern of speech which they are unaware of but learners from other languages use bundles that are affected (transferred) bye their mother tongue and this solely can be problematic and easily recognizable. moreover in academia, when speakers of source language trying to write, publish, and produce academic literature in the target language their lack the fluency and native like features of a native speaker of that language. Lexical bundles (or as called N-grams) are crucial in getting a fluent academic text. In this paper lexical bundles of 1 to 5 tokens from an 8 million word corpus of academic literature from the Computational Linguistics field and its sub topics such as: Speech recognition, Natural Language Processing, Machine Learning, and Information Retrieval have been extracted and analyzed. On the top of that most of typical criteria for exclusion has been applied to the list as well as calculating MI factor for each result to confirm the results and reaching the target bundles for Computational Linguistics.
منابع مشابه
ACADEMIC WRITING REVISITED: A PHRASEOLOGICAL ANALYSIS OF APPLIED LINGUISTICS HIGH-STAKE GENRES FROM THE PERSPECTIVE OF LEXICAL BUNDLES
Lexical bundles are frequent word combinations that commonly appear in different registers. They have been the subject of much research in the area of corpus linguistics during the last decade. While most previous studies of bundles have mainly focused on variations in the use of these word combinations across different registers and a number of disciplines, not much research has been done to e...
متن کاملPublished vs. Postgraduate Writing in Applied Linguistics: The Case of Lexical Bundles
Abstract: Lexical bundles, as building blocks of coherent discourse, have been the subject of much research in the last two decades. While many of such studies have been mainly concerned with exploring variations in the use of these word sequences across different registers and disciplines, very few have addressed the use of some particular groups of lexical bundles within some gen...
متن کاملThe Use of Lexical Bundles in Native and Non-native Post-graduate Writing: The Case of Applied Linguistics MA Theses
Connor et al. (2008) mention “specifying textual requirements of genres” (p.12) as one of the reasons which have motivated researchers in the analysis of writing. Members of each genre should be able to produce and retrieve these textual requirements appropriately to be considered communicatively proficient. One of the textual requirements of genres is regularities of specific forms and content...
متن کاملA Comparative Study of Lexical Bundles in Soft Science Articles Written by Native and Iranian Authors
Writing academic texts by novice researchers requires a framework and support by learning how to cite the works of others. However, compared to the studies on other academic writings, studying citations by considering certainty markers has received little attention. The main purpose of this study was to investigate the shifts of certainty markers (hedges and boosters) in pre- and post-citation ...
متن کاملLexical Bundles in English Abstracts of Research Articles Written by Iranian Scholars: Examples from Humanities
This paper investigates a special type of recurrent expressions, lexical bundles, defined as a sequence of three or more words that co-occur frequently in a particular register (Biber et al., 1999). Considering the importance of this group of multi-word sequences in academic prose, this study explores the forms and syntactic structures of three- and four-word bundles in English abstracts writte...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1603.02905 شماره
صفحات -
تاریخ انتشار 2016